AlexSys: a knowledge-based expert system for multiple sequence alignment construction and analysis

نویسندگان

  • Mohamed Radhouene Aniba
  • Olivier Poch
  • Aron Marchler-Bauer
  • Julie Dawn Thompson
چکیده

Multiple sequence alignment (MSA) is a cornerstone of modern molecular biology and represents a unique means of investigating the patterns of conservation and diversity in complex biological systems. Many different algorithms have been developed to construct MSAs, but previous studies have shown that no single aligner consistently outperforms the rest. This has led to the development of a number of 'meta-methods' that systematically run several aligners and merge the output into one single solution. Although these methods generally produce more accurate alignments, they are inefficient because all the aligners need to be run first and the choice of the best solution is made a posteriori. Here, we describe the development of a new expert system, AlexSys, for the multiple alignment of protein sequences. AlexSys incorporates an intelligent inference engine to automatically select an appropriate aligner a priori, depending only on the nature of the input sequences. The inference engine was trained on a large set of reference multiple alignments, using a novel machine learning approach. Applying AlexSys to a test set of 178 alignments, we show that the expert system represents a good compromise between alignment quality and running time, making it suitable for high throughput projects. AlexSys is freely available from http://alnitak.u-strasbg.fr/∼aniba/alexsys.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Knowledge-based expert systems and a proof-of-concept case study for multiple sequence alignment construction and analysis

The traditional approach to bioinformatics analyses relies on independent task-specific services and applications, using different input and output formats, often idiosyncratic, and frequently not designed to inter-operate. In general, such analyses were performed by experts who manually verified the results obtained at each step in the process. Today, the amount of bioinformatics information c...

متن کامل

An Application of the ABS LX Algorithm to Multiple Sequence Alignment

We present an application of ABS algorithms for multiple sequence alignment (MSA). The Markov decision process (MDP) based model leads to a linear programming problem (LPP), whose solution is linked to a suggested alignment. The important features of our work include the facility of alignment of multiple sequences simultaneously and no limit for the length of the sequences. Our goal here is to ...

متن کامل

Safety Risks Impacts Analysis on Construction Project Objectives Using a Hybrid Model of Fuzzy Expert System and Latin Hyper Cube Sampling

Background and aims: The construction industry has a high rate of fatal or nonfatal injuries and all around the world which remains one of the most dangerous occupations till now. Since project safety and measuring danger in the construction industry is a crucial subject, so this study aimed to measure the impacts of safety risks on the time and cost objectives of project using a hybrid method ...

متن کامل

A Dynamic Fuzzy Expert System Based on Maintenance Indicators for Service Type Selection of Machinery

Due to the multiplicity of standards and complex rules; maintenance, repair and servicing of machinery could be done only by the fully qualified and proficient experts. Since the knowledge of such experts is not available all times, using expert systems can help to improve the maintenance process. To address this need and the uncertainty of the maintenance process indicators, this research prop...

متن کامل

A generalization of Profile Hidden Markov Model (PHMM) using one-by-one dependency between sequences

The Profile Hidden Markov Model (PHMM) can be poor at capturing dependency between observations because of the statistical assumptions it makes. To overcome this limitation, the dependency between residues in a multiple sequence alignment (MSA) which is the representative of a PHMM can be combined with the PHMM. Based on the fact that sequences appearing in the final MSA are written based on th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 38  شماره 

صفحات  -

تاریخ انتشار 2010